STSM: Spatio-Temporal Shift Module for Efficient Action Recognition
نویسندگان
چکیده
The modeling, computational complexity, and accuracy of spatio-temporal models are the three major foci in field video action recognition. traditional 2D convolution has low but it cannot capture temporal relationships. Although 3D can obtain good performance, is with both high complexity a large number parameters. In this paper, we propose plug-and-play Spatio-Temporal Shift Module (STSM), which effective high-performance module. STSM be easily inserted into other networks to increase or enhance ability network learn features, effectively improving performance without increasing parameters complexity. particular, when CNNs integrated, new may features outperform based on convolutions. We revisit shift operation from perspective matrix algebra, i.e., sparse kernel. Furthermore, extensively evaluate proposed module Kinetics-400 Something-Something V2 datasets. experimental results show effectiveness STSM, recognition also achieve state-of-the-art two benchmarks.
منابع مشابه
Spatio-temporal SURF for Human Action Recognition
In this paper, we propose a new spatio-temporal descriptor called ST-SURF. The latter is based on a novel combination between the speed up robust feature and the optical flow. The Hessian detector is employed to find all interest points. To reduce the computation time, we propose a new methodology for video segmentation, in Frames Packets FPs, based on the interest points trajectory tracking. W...
متن کاملHuman Action Recognition Using Spatio-temporal Classification
In this paper a framework “Temporal-Vector Trajectory Learning” (TVTL) for human action recognition is proposed. In this framework, the major concept is that we would like to add the temporal information into the action recognition process. Base on this purpose, there are three kinds of temporal information, LTM, DTM, and TTM, being proposed. With the three kinds of proposed temporal informatio...
متن کاملSpatio-temporal Aware Non-negative Component Representation for Action Recognition
This paper presents a novel mid-level representation for action recognition, named spatio-temporal aware non-negative component representation (STANNCR). The proposed STANNCR is based on action component and incorporates the spatial-temporal information. We first introduce a spatial-temporal distribution vector (STDV) to model the distributions of local feature locations in a compact and discri...
متن کاملImproved Spatio-temporal Salient Feature Detection for Action Recognition
Spatio-temporal salient features localize the local motion events and are used to represent video sequences for many computer vision tasks such as action recognition. The robust detection of these features under geometric variations such as affine transformation and view/scale changes is however an open problem. Existing methods use the same filter for both time and space and hence, perform an ...
متن کاملGenetic Programming-Evolved Spatio-Temporal Descriptor for Human Action Recognition
The potential value of human action recognition has led to it becoming one of the most active research subjects in computer vision. In this paper, we propose a novel method to automatically generate low-level spatio-temporal descriptors showing good performance, for high-level human-action recognition tasks. We address this as an optimization problem using genetic programming (GP), an evolution...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Mathematics
سال: 2022
ISSN: ['2227-7390']
DOI: https://doi.org/10.3390/math10183290